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USE OF SELECTIVE LABELING TO DETECT AND 
CHARACTERIZE MOLECULAR INTERACTIONS BY 
NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY 

Priority Claim 

[0001] This application claims priority under 35 U.S.C. § 1 19(e) to United States 
Provisional Application No. 60/422,638, filed October 29, 2002, which is hereby expressly 
incorporated by reference. 

Background of the Invention 

Field of the Invention 

[0002] The present invention relates to selectively labeled molecules and the use 
of nuclear magnetic resonance spectroscopy to detect and characterize molecular interactions 
between the selectively labeled molecules and putative ligands. 

Background of the Invention 

[0003] Random screening of synthetic and natural product libraries to discover 
compounds that bind to a particular target molecule is a common method for obtaining new 
pharmaceutical entities. A target molecule is typically exposed to one or more compounds 
and assays are performed to determine if complexes between the target molecule and one or 
more of the compounds are formed. Once random screening has identified a potential 
candidate, or lead compound, analogs of the lead compound are synthesized in an effort to 
improve binding and selectivity. The analogs are then screened, usually by the same assays 
used to identify the lead compound. 

[0004] A common problem associated with current screening methods involves 
non-specific ligand binding; that is, the ligand attaches to the target molecule but in a non- 
specific manner. Such non-specific binding often occurs and is difficult to detect. In such 
cases, optimization of ligand binding is greatly complicated. The full potential for 
developing pharmaceutical-based therapies depends not only on screening vast numbers of 
compounds against a target, but also on determining specific details concerning interactions 
between the ligand and the target molecule. 



[0005] Nuclear magnetic resonance (NMR) spectroscopy is known for its ability 
to characterize macromolecular structures, and is a technique for investigating both static and 
transient features of ligand binding to a target molecule (Pellecchia, et al., Nature Rev Drug 
Disc 2002, 1 :21 1). NMR spectroscopy is a useful tool for determining the binding of ligands 
to target molecules, and has the advantage of being able to detect and quantify interactions 
with high sensitivity without requiring prior knowledge of protein function. Furthermore, 
NMR spectroscopy can provide structural information on both the target and the ligand to aid 
subsequent optimization of weak-binding hits into high-affinity leads. 

[0006] Methods of detecting binding of a ligand compound to a target 
biomolecule by generating first and second nuclear magnetic resonance correlation spectra 
from target biomolecules which have been uniformly labeled are reported in U.S. Patent Nos. 
5,698,401 and 5,804,390. The first spectrum is generated from data collected on the target 
substance in the absence of ligands, and the second in the presence of one or more ligands. A 
comparison of the two spectra permits determination of which compounds in the mixture of 
putative ligands bind(s) to the target biomolecule. 

[0007] Despite the broad applicability of using NMR spectroscopy to determine 
molecular interactions, its use with macromolecules is complicated by the extremely complex 
spectra associated with macromolecules. Although, isotope labeling in macromolecular 
NMR spectroscopy can result in increased sensitivity and resolution and in reduced 
complexity of the NMR spectra, attempts to improve NMR analysis of macromolecules by 
labeling target molecules with NMR active isotopes has been only marginally successful. 
Isotope labeling would promote the efficient use of heteronuclear multi-dimensional NMR 
experiments and provided alternative approaches to the spectral assignment process and 
additional structural constraints from spin-spin coupling. 

[0008] Current methods of assessing molecular interaction that utilize NMR 
spectroscopy typically rely on uniform labeling of the target molecule. When using uniform 
labeled target molecules, a complete and 15 N resonance assignment of at least the 
backbone nuclei of the target molecule must be completed before any ligand-protein 
interactions can be assessed by NMR spectroscopy. The assignments are then used to map 
the interactions of a ligand by following chemical shift changes upon complexation. This 
process can be quite lengthy and is generally limited to small-to-medium-sized proteins due 



2 



to line broadening and spectral overlap with larger proteins. Even with the advent of TROSY 
(Transverse Relaxation Optimized Spectroscopy) (Pervushin, et al., Proc Natl Aca. Sci USA 
1997, 94:12366; Wiithrich, Nat Struct Biol 1998, 5:492) and perdeuteration, the problem of 
complete resonance assignments in target molecules remains a major stumbling block for 
NMR ligand-binding studies. 

[0009] There have been attempts to overcome the problems associated with 
uniform labeling of target molecules by attempting to selectively label the methyl groups of 
the amino acids isoleucine, methionine, threonine, alanine, leucine, and valine. However, 
these amino acids often are present in the interior of the protein and are not involved in 
ligand or protein binding. 

[0010] Thus, there is a need for methods for determining molecular interactions 
by NMR spectroscopy that result in a simplification of the spectrum and enable the study of 
significantly larger macromolecules. Also needed are improved methods for high-throughput 
screening using NMR spectroscopy. 

Summary of the Invention 

[0011] This invention provides selectively labeled molecules and methods for 
producing these selectively labeled molecules. Also provided is a method for detecting 
ligand interaction with a target molecule. The selectively labeled molecules of the invention 
are exposed to one or more putative ligands and examined by NMR spectroscopy. The 
methods disclosed herein efficiently detect ligand binding to the labeled target molecule. 

[0012] Also provided are methods to determine the dissociation constant for the 
ligand-molecule complex. In addition, the methods described herein provide structural 
information concerning protein-molecule interactions that can be used to guide the 
optimization of the binding of a ligand. Also provided are methods for high-throughput 
screening using the selectively labeled molecules of the invention. 

[0013] In one embodiment of the invention, novel methods are provided for 
selectively labeling polypeptides by incorporation of 1 H, 13 C, ,5 N and/or 19 F into the side 
chain of one or more amino acid residues. 

[0014] The methods described herein allow studies of protein-protein interactions 
and protein-ligand interactions that are relevant to drug-design and discovery. In addition, the 
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methods can be used to characterize the binding properties of proteins with unknown 
function. 

Brief Description of the Drawings 

[0015] Fig. 1A. The 2D 13 C,1H-HMQC spectra of [4- 13 C]-Tryptophan labeled 
inhibitor of apoptosis (IAP) third BIR domain (BIR3) non-bound (apo). 

[0016] Fig IB. The 2D ,3 C,1H-HMQC spectra of [4- 13 C]-Tryptophan labeled 
inhibitor of apoptosis (IAP) third BIR domain (BIR3) complexed with an inhibitor peptide 
AVPL 

Detailed Description of the Preferred Embodiment 
[0017] Described herein are methods for selectively incorporating labels into 
target molecules that are useful for examining molecular interactions between the selectively 
labeled target molecules and putative ligands. Selective labeling permits studies with larger 
target molecules due to a lowered degree of complexity in the NMR spectra. In addition, 
selective labeling provides target molecules that are labeled in a manner previously 
unobtainable. The selective labeling disclosed herein enhances the results of NMR 
experiments, in particular NMR experiments such as NOE and TROSY experiments. 

[0018] Target molecules are any molecule capable of examination by NMR 
spectroscopy. Target molecules can be biomolecules or portions of biomolecules. Examples 
include lipoproteins, lipoprotein fragments, glycoproteins, glycoprotein fragments, proteins, 
protein fragments, and polypeptides. The terms protein and polypeptide are interchangeable 
terms. The term "protein" refers to native polypeptide structures and to fragments of the 
native structures. The term "polypeptide" includes native protein structures. Target 
molecules are preferably native protein structures, with or without post-translational 
modification. More preferred as target molecules are native protein structures that are post- 
translationally modified. 

[0019] Selective labeling is defined as labeling substantially every occurrence of 
at least one particular amino acid throughout a polypeptide sequence. Preferably, at least 
about 50% of the particular amino acid or amino acids are labeled throughout the polypeptide 
sequence. More preferably, at least about 75%, or at least about 90% of the particular amino 
acid or amino acids are labeled throughout the polypeptide sequence. Most preferably, 
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substantially each occurrence of the particular amino acid or amino acids are labeled 
throughout the polypeptide sequence. Where there is more than one occurrence of an amino 
acid that is selectively labeled in a target molecule, each labeled amino acid side chain is 
preferably labeled identically. 

[0020] Ligands are any composition that may interact with a target molecule. 
Ligands include substrates, such as enzyme substrates, agonists, antagonists, mixed agonists 
or antagonist, any biomolecule such as a polypeptide, protein, carbohydrate, lipid, and the 
like. The ligand can be naturally occurring or synthetic. 

[0021] The particular amino acid or amino acids to be labeled are preferably those 
located at a functional site of a polypeptide. A functional site is involved in ligand 
interaction, catalytic activity, conformational switches, and/or protein-protein interactions. 
Alternatively, the functional site is a cluster of amino acids whose structural, dynamic and 
physiochemical properties directly impact ligand interaction and/or transformation. 
Functional sites are often found on protein surfaces or at protein active sites. The residues 
involved in functional sites are especially likely to form binding sites and/or active sites and 
are good choices for selective labeling. 

[0022] Since functional sites undergo fewer mutations during evolution than other 
parts of a protein, functional sites can be determined by identifying sequence conservation 
patterns. This can be done by profile analysis (Gribskov et al., Methods Enzymol 1990, 
183:146), hidden Markov models (Eddy, Bioinformatics 1998, 14:755), by BLAST, such as 
PSI-BLAST (Altschl et al., Nucleic Acids Res 1997, 25:3389), or by evolutionary tracing 
(Lichtarge and Sowa, Curr Opin Struct Biol 2002, 12:21). These methods also indicate 
putative functions of the proteins. 

[0023] Preferred amino acids for selective labeling are tryptophan, arginine and 
tyrosine. Also preferred are aromatic amino acids, i.e., phenylalanine, tyrosine, and 
tryptophan. A most preferred amino acid for selective labeling is tryptophan because 
tryptophan side chains are often found at protein binding sites or hot sites and rarely occur in 
the protein interior. 

[0024] Selectively labeled molecules, preferably selectively labeled polypeptides, 
can be exposed to a second molecule and any molecular interaction can be examined by 
NMR spectroscopy. For example, 2D 13 C, ] H-HMQC (heteronuclear multiple quantum 
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coherence) and 13 C-edited 'h/H-NOESY NMR experiments can be used to detect molecular 
interaction and to determine the dissociation constant for any complex. In addition, a 
predictive model can be created based on the three-dimensional structure of the target and 
from the relative position of the ligand with respect to the labeled side chain. The use of 
several different labeled side-chains in a single, selectively-labeled, target-molecule will 
improve the resolution as well as the predictive nature of the model. 

[0025] Three amino acids that are commonly found at functional areas of proteins 
are the two aromatic residues tryptophan and tyrosine and the positively charged amino acid 
arginine. Among these, tryptophan residues are known to be the most frequently occurring 
amino acid at protein binding sites (Bogan and Thorn, J, Mol Biol 1998, 280:1). These 
amino acids are capable of making multiple types of favorable interactions that are preferred 
in the lowered effective dielectric environment of hot spots. Tryptophan, for example, can 
contribute aromatic ^-interactions, a hydrogen-bonding donor, and a large hydrophobic 
surface. Previous thermodynamic and computational studies have also pointed to the 
important energetic role of aromatic contacts, and of tryptophan in particular, in generating 
the exceptional binding free energy of streptavidin-biotin association. Crystallographic 
characterization of single site tryptophan mutants indicate that several tryptophan residues 
(Trp79, Trp92, Trpl08 and Trpl20) play an important role in the hydrophobic binding 
contributions in streptavidin-biotin interactions. Thus, one aspect of the invention is the 
selective labeling of tryptophan side chains in a target molecule and NMR spectrograph^ 
evaluation of the target molecule with putative ligands. Because of the pivotal role of 
tryptophan side chains in protein-protein and protein-ligand interactions, selective 
observation of the NMR signals from these residues provides clear and specific information 
concerning intermolecular interactions. 

[0026] In one embodiment of the invention, a labeling scheme is provided that 
incorporates one or more labels into selected amino acid side chains of a target molecule. 
For example, a method for C-labeling one or more of the carbon atoms of a side chain of an 
amino acid residue, and incorporation of the labeled residue in to an otherwise 12 C-labeled 
target molecule is provided. In addition, a method for ^-labeling of one or more atoms in a 
side chain of an amino acid residue and incorporation of the labeled residue in to an 
otherwise 2 H-labeled target molecule is provided. In addition, a method for 19 F-labeling of a 
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side chain of an amino acid residue and incorporation of the labeled residue in to a 
polypeptide is provided. In a particular embodiment, the invention provides [4- C] and/or 
[2- 13 C] nuclei in tryptophan side-chains in an otherwise 12 C-labeled target molecule. In 
another embodiment of the invention the selective incorporation of 19 F in tryptophan side 
chains at positions 5 or 6 of a target molecule is also provided. By similar methods, other 
nuclei, such as 15 N, can also be incorporated. 

[0027] Thus, an efficient and cost-effective method for selective incorporation of 
labeled amino acid residues into target molecules is provided. Preferred labels are atoms 
having unpaired nuclear spins. Examples of such atoms are •h, ,3 c, ,5 n, ,9 f, 3, p. Target 
molecules are preferably labeled with *H, 13 C, l5 N, 19 F. More preferred, the target molecules 
are selectively labeled with *H , 13 C, or 19 F, and any combination of 19 F, ! H and/or 13 C. 




tryptophan 

Scheme 1. Chemical structure of tryptophan. (Numbering according the IUPAC 

nomenclature). 

[0028] Selective incorporation of amino acid residues into polypeptides, for 
example recombinant proteins expressed in bacteria, preferably Escherichia coli, can be 
achieved by employing the appropriate growth media (minimal media) supplemented with 
the desired labeled amino acid(s) (Kay and Gardner, Curr Op Struct Biol, 1997, 7:722). 
Most of the 20 natural amino acids are> commercially available with different isotope 
compositions, obtained by chemical synthesis or purified from labeled algae extracts. 
Common commercial suppliers include Cambridge stable isotopes, Spectra Stable Isotopes, 
and ISOTEC-Sigma. Alternatively, the appropriate growth media can be supplemented with 
the appropriate labeled precursors of the given amino acid, based on the biosynthetic 
pathways of the given amino acid. For applications in ligand binding studies and drug- 
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discovery, the method of choice depends on several factors such as the degree of 
incorporation of the selective label, the degree of scrambling of the wanted labels into other 
amino acids or positions, and the ability to obtain desired amounts of selectively labeled 
protein in a cost-effective manner. 

[0029] Means for preparing expression vectors that contain polynucleotide 
sequences coding specific polypeptides and for transforming host cells with those vectors are 
well known in the art. (See, for example, Old et al., "Techniques of Gene Manipulation," 
Blackwell Science, London, 1994, and similar treatises in the field). Likewise, methods for 
culturing the transformed cells to express the coded polypeptide and for isolating, purifying 
and re-folding the polypeptide are also well known in the art. 

[0030] The shikimic acid pathway is the key metabolic pathway in the 
biosynthesis of the aromatic amino acids - tyrosine, phenylalanine, and tryptophan - which 
occurs in bacteria. The initial step of the pathway is the condensation of erythrose-4- 
phosphate (from the pentose phosphate pathway) and phosphoenolpyruvate (from 
glycolysis). The product of the condensation reaction is then cyclized and reduced to form 
the intermediate compound shikimate, with the phenolic ring structure characteristic of the 
aromatic amino acids. Combination of shikimate with an additional molecule of 
phosphoenolpyruvate produces chorismate, from which alternative pathways lead to either 
phenylalanine/tyrosine or to tryptophan. 
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[0031] The biosynthesis of tryptophan from chorismate involves four 
intermediates: anthranilate, ribosylanthranilate, 1 '-(O-carboxyphenylamino)-l 

deoxyribulose-5-phosphate, and indoleglycerol phosphate. The biosynthesis occurs via two 
interconnected pathways that lead to indole (and/or indole-3-glycerolphosphate) that are 
readily converted into tryptophan via the enzyme tryptophan-syntase (Crawford, Meth Enz 
1987, 142:293). In the last step of the pathway serine serves as the donor of the a-carbon 
amino group of tryptophan (Scheme 2). 




Indole 

Scheme 2. Tryptophan biosynthetic pathway in bacteria. G-3P = D-glyceraldehyde-3- 
phosphate; TS-A = tryptophan-synthase subunit A; TS-B = tryptophan-synthase, subunit B. 

[0032] The biosynthesis of tyrosine from chorismate takes place by two 
alternative routes. The first intermediate in both routes is prephenate, which is converted to 
either 4-hydroxyphenylpyruvate or arogenate. Tyrosine is then produced from each of these 
intermediates. The biosynthesis of phenylalanine from chorismate also takes place by two 
alternative routes. The first intermediate in both routes is also prephenate, which is 
converted to either phenylpyruvate or the common intermediate arogenate. Phenylalanine is 
then produced from each of these intermediates. 

[0033] In one embodiment, labeled indole is added to a recombinant bacteria, 
engineered to express a target molecule of interest, prior to induction of protein expression. 
A target molecule is then produced having all its tryptophan residues labeled. Similarly, 
labeled 4-hydroxyphenylpyruvate or labeled phenylpyruvate can be added to a recombinant 
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bacteria, engineered to express a target molecule of interest, prior to induction of protein 
expression. The target molecule is then produced having its tyrosine or phenylalanine 
residues labeled. When one or more labels are incorporated into one or more positions of 
indole, 4-hydroxyphenylpyruvate, or phenylpyruvate the resulting target molecules have 
substantially all of the corresponding amino acid side chains labeled. When ! H is selectively 
incorporated into the target molecule, the recombinant bacteria engineered to express a target 
molecule of interest is grown on media containing 2 H prior to induction of protein 
expression. 

[0034] The methods of the invention are exemplified with aromatic amino acids, 
but the methods can be adapted to the other amino acids based on the biosynthetic pathways 
of the given amino acid. For example, when selectively labeling aromatic amino acids, 
labeled precursors and specific inhibitors of the particular amino acid are used. Likewise, 
when selectively labeling non-aromatic amino acids, labeled precursors and specific 
inhibitors of the particular amino acid are used. For example, arginine is synthesized by 
bacteria from glutamate. Thus, adding a labeled arginine precusor, such as ornithine, 
citrulline, or argininosuccinate, and a selective inhibitor of one of these steps to a 
recombinant bacteria, engineered to express a target molecule of interest, prior to induction 
of protein expression, will produce a target molecule having its arginine residues labeled. 
Alternatively, labeled arginine and a similarly labeled arginine precursor can be added to a 
recombinant bacteria engineered to express a target molecule of interest to produce a 
selectively labeled target molecule. 

[0035] Selectively labeled target molecules produce greatly simplified NMR 
spectra. Moreover, selective 13 C labeling of target molecules has the additional advantage of 
permitting use of the aromatic-TROSY effect that is optimal at magnetic fields corresponding 
to 600 MHz *H frequency (Pervushin et al., Proc Natl Acad Sci USA 1997, 94:12366). 
Selective *H labeling is particularly useful in larger target molecules, as the deleterious l H- l H 
dipole-dipole interactions will be mostly replaced by the less efficient 2 H- ! H dipole-dipole 
interactions. Elimination of the ^-'H dipole-dipole interactions enables the study of very 
large target molecules by NMR spectroscopy. In addition, measurements of intermolecular 
distances in complexes by nuclear Overhauser effects (NOEs) can be readily obtained using 
the selectively ^-labeled target molecules. In addition, 2D l H 9 x H NOESY and/or 13 C- 
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resolved (or edited) H, H NOESY pulse sequences can be used to determine intermolecular 
distances using the selectively ^-labeled target molecules. 

[0036] In one embodiment, the invention provides an efficient and cost-effective 
method for incorporation of 2- 13 C-tryptophan and/or 4- 13 C-tryptophan in to target molecules. 
Details of the method are provided below in Example 1. The method uses 2- 13 C indole 
and/or 4- ,3 C indole as precursors for tryptophan synthesis in bacteria, preferably in 
Escherichia coli. The labeled indole precursors are commercially available from suppliers 
such as Cambridge Isotopes Laboratories. 

Example 1 

[0037] Escherichia coli strains comprising an inducible vector for a given protein 
were grown in minimal growth media containing 12 C-, ^-glucose as the sole carbon source 
in addition to salts and other nutrients in amounts necessary for bacterial growth prior to 
induction. Once the bacteria reached an acceptable mass, measured, for example, in terms of 
optical density at 600 nm of the growth media (OD 6 oo), protein expression was induced. At 
this point, the bacterial growth is minimal and the metabolism of the bacteria is shifted 
towards the biosynthesis of proteins and consequently of amino acids. Addition of small 
amounts of 2- 13 C-indole or 4- 13 C-indole to the growth media resulted in efficient 
transformation of these precursors into labeled tryptophan, via the tryptophan-synthase 
pathway, which was incorporated into various target molecules. Because indole can be 
transformed into tryptophan in a single step by the bacteria, and because of its unique 
chemical structure, being quite distinct form any other common metabolite, there was no 
scrambling of the label in the target molecule when various labeled indoles were used as 
precursors. 

Example 2 

[0038] In an additional embodiment of the invention, selective *H labeling of 
tryptophan side chains (with or without 13 C labeling) was achieved by perdeuteration of the 
target molecule by replacing water with 2 H 2 0 in the growth media, and supplementing indole 
(or 13 C-indole) to the media to obtain a target molecule that is uniformly 2 H-labeled with the 
exception of selectively l H-labeled tryptophan side chains. This selective ^-tryptophan 
labeling is particularly useful in larger proteins, as the deleterious 'lI-'H dipole-dipole 
interactions will be mostly replaced by the less efficient 2 H- ! H-tryptophan dipole-dipole 
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interactions. Elimination of the H- H dipole-dipole interactions enables the study of very 
large proteins and/or protein complexes by NMR spectroscopy. In addition, measurements 
of ^-tryptophan - ligand distances in complexes via ^-tryptophan/ 1 H-ligand NOEs can be 
readily obtained by using the selectively ^-labeled target molecules. In addition, 2D l H 9 l H 
NOESY and/or 13 C-resolved (or edited) NOESY pulse sequences can be used to 

determine intermolecular distances by using the selectively ! H-labeled target molecules. 

Example 3 

[0039] The incorporation of 19 F-labeled amino acids into target molecules has 
been reported. For example, supplementation of growth media with the 19 F-labeled amino 
acids has been reported. However, not only was the level of 19 F incorporation was too low to 
be useful, the location of the 19 F nucleus in the aromatic amino acid was not uniform (Sixl et 
al., Biochem J 1990, 266:545). Simple supplementation of growth media with the I9 F- 
labeled amino acids suffers from the additional problem that 19 F- tryptophan is toxic to strains 
that are wild-type with respect to tryptophan biosynthesis. 

[0040] Another reported method attempted to incorporate 19 F-labeled aromatic 
amino acids into target molecules by blocking tryptophan biosynthesis with glyphosate, 
which is an inhibitor of the 3-enolpyruvyl-shikimate-5-phosphate synthase reaction of 
aromatic amino acid biosynthesis (Kim et al., FEBS 1990, 272:34). Glyphosate, however, 
completely inhibits the entire synthetic pathway for all aromatic amino acids, so that these 
need to supplemented to the media as well to ensure bacterial growth. 

[0041] Another embodiment of this invention provides several methods to 
efficiently incorporate 19 F-labeled amino acids into polypeptides. In one embodiment of the 
invention the tryptophan residues of a polypeptide are 19 F labeled. For example, recombinant 
bacteria engineered to express a target molecule were incubated with 19 F-tryptophan and 
indoleacrylic acid, an inhibitor of tryptophan-synthase. Indoleacrylic acid is quite similar to 
tryptophan and earlier reports on the antibiotic properties of this compound on certain 
bacterial strains were attributed to the inhibition of biosynthesis of tryptophan, as 
accumulation of indole was detected (Fildel, Br J Exp Pathol 1945, 26:416). In another 
embodiment of the invention, recombinant bacteria engineered to express a target molecule 
were incubated with 19 F-tryptophan, 19 F-indole (with the same fluorinated position as 19 F- 
tryptophan) and serine. In this way, biosynthesis of tryptophan is either inhibited by ,9 F- 
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indole or results in synthesis of additional 19 F-tryptophan. Similarly, tryptophan with other 
labels, such as ] H, 15 N, 13 C, can be incorporated into a selectively labeled target molecule. 

Example 4 

[0042] Programmed cell death, or apoptosis, occurs naturally during the 
development and maintenance of animal tissues and organs. During these processes more 
cells are produced than are required for building tissues and organs. The unwanted cells are 
programmed to die, either because the chemical signals that direct them to go on living are 
suppressed or because they receive a specific signal to die. Suppression of apoptosis is 
associated with a variety of diseases such as neurodegenerative disorders and the 
uncontrolled cell growth in leukemia and other cancers. 

[0043] One class of proteins that negatively regulates cell death signaling is the 
inhibitor of apoptosis proteins (IAPs). Members of this family are characterized by having 
one or more baculovirus IAP repeats called BIR domains, which consist of approximately 70 
amino acids that contain the characteristic signature sequence CX2CX16HX6C. 

[0044] IAPs are known to bind to and inhibit the cysteine proteases known as 
caspases, which play a key role in the execution of programmed cell death. Caspase 
inhibition by the IAPs can directly explain their antiapoptotic activities. Human X-linked 
inhibitor of apoptosis protein (XIAP) inhibits caspase-9. The BIR3 domain of XIAP is 
responsible for the potent inhibition of caspase-9. Of the approximately 70 amino acids of 
the BIR3 domain, only Glu-314, Trp-310, and His-343 have been shown to have a large 
effect on caspase-9 inhibition (Sun et al., J Biol Chem 2000, 275:33777). 

[0045] Selectively labeled BIR3 was prepared according to the methods of this 
invention. In particular, 5- 19 F-tryptophan-labeled BIR3 and 6- I9 F-tryptophan-labeled BIR3 
were prepared by the methods described herein. The fluorinated analogs 5- 19 F-tryptophan 
and 6- I9 F-tryptophan are commercially available from Sigma. One-dimensional 19 F NMR 
spectra of 5- 19 F-tryptophan BIR3 reveals the presence of fours peaks, one for each 
tryptophan in the sequence. The yields of 19 F incorporation into the target compounds were 
compared by using mass spectrometry. The results from these experiments showed that the 
use of indoleacrylic acid results in higher 19 F incorporation for the four tryptophan residues 
in BIR3. In fact the incorporation was nearly 100% efficient. 
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[0046] Fluorinated analogues of aromatic amino acids are particularly useful 
because the variations in chemical shift of the fluorine residues are sufficiently large to 
permit identification of individual resonances, even in relatively large proteins. Additional 
advantages resulting from use of the 19 F nucleus are that I9 F occurs in 100% natural 
abundance and the sensitivity is close to that for the proton. 

[0047] Figure 1 shows a two-dimensional 13 C, ] H HMQC spectra recorded on a 
Varian Unity+ 500 spectrometer with a 100 sample of 4- ,3 C-tryptophan labeled BIR3 
domain. Four tryptophan residues are present in the sequence of the protein. The asterisk in 
Figure IB indicates the tryptophan residue involved in interaction with smac N-terminal 
tetrapeptide region. 

[0048] The selectively labeled 5- 19 F-tryptophan and/or 6- I9 F-tryptophan target 
molecules are utilized in ID 19 F NMR and heteronuclear ^F/H-NOESY NMR experiments 
to analyze ligand interactions such as ligand binding. The selectively labeled target 
molecules are also utilized to determine the dissociation constant for target molecule-ligand 
complexes, and to provide models based on the three-dimensional structure of the target and 
the relative position of the ligand with respect to labeled side-chain(s). These experiments 
are quite sensitive because of the high magnetic susceptibility of 19 F. 

Example 5 

[0049] The dissociation constant, KD, for a given ligand and its target molecule 
can be determined by generating a first NMR spectrum of a specifically labeled target 
molecule; exposing the target molecule to various concentrations of a ligand; generating a 
NMR spectrum at each concentration of ligand employed; comparing the spectra generated to 
the first spectrum of the target molecule; and calculating the dissociation constant between 
the target molecule and the ligand from those differences. 

[0050] The screening method can be used to determine the dissociation constant 
of one ligand of the target molecule in the presence of a second molecule. The process of 
determining the dissociation constant of a ligand in the presence of a second bound ligand is 
performed by mixing the selectively labeled target molecule to a ligand before exposing the 
target molecule to the test compounds. The screening method is additionally able to provide 
information regarding the binding of a second or subsequent ligand to the target molecule. 
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This second ligand may be chemically linked to the first ligand bound to the target molecule, 
thus providing a new composite molecule for use in affecting the target molecule. 

[0051] In one embodiment of the invention, the selectively labeled target 
molecules are used in high-throughput screening. For example, a plurality of putative 
ligands are exposed sequentially or simultaneously to a selectively labeled target molecule. 
In another embodiment, a ligand is exposed to a plurality of selectively labeled target 
molecules, either sequentially or simultaneously. In another embodiment, a plurality of 
ligands are exposed to a plurality of labeled target molecules. In cases where a plurality of 
labeled target molecules are used, it is preferred to use targets with substantially non- 
overlapping spectra. 

[0052] When more than one compound is screened for binding to a target 
molecule, for example a mixture or a library of compounds, and where a difference arises 
between the first spectrum generated from the target molecule alone and that generated from 
the target molecule in the presence of compound(s), additional steps are performed to 
identify which specific compound or compounds contained in the mixture is actually binding 
to the target molecule. Those additional steps include exposing the target molecule 
individually to each compound of the mixture; generating a NMR spectrum of the labeled 
target molecule that has been individually exposed to each compound; and comparing each 
spectrum to the first spectrum generated from the target molecule alone to determine 
differences in any of those compared spectra. The differences in the spectra facilitate the 
identification of a compound that is a ligand. 

[0053] In one embodiment of the invention, an orphan gene is cloned and its 
corresponding protein is selectively labeled. Based on sequence homology data, ligands are 
selected and screened for interaction. Positive interaction between the ligand and protein 
validates the sequence homology analysis, while negative interactions invalidate the 
sequence homology analysis. 

[0054] While the present invention has been described in some detail for purposes 
of clarity and understanding, one skilled in the art will appreciate that various changes in 
form and detail can be made without departing form the true scope of the invention. All 
figures, tables, and appendices, as well as patents, applications, and publications, referred to 
above, are hereby incorporated by reference in their entireties. 
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